AITopics | optimistic algorithm

Collaborating Authors

optimistic algorithm

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

33d6548e48d4318ceb0e3916a79afc84-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 10:29:14 GMT

artificial intelligence, machine learning, probability, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.93)

Add feedback

Information-Theoretic Confidence Bounds for Reinforcement Learning

Xiuyuan Lu, Benjamin Van Roy

Neural Information Processing SystemsFeb-12-2026, 00:21:59 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, information gain, thompson, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs

Max Simchowitz, Kevin G. Jamieson

Neural Information Processing SystemsFeb-11-2026, 12:08:08 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, dependence, optimistic algorithm, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
North America > Canada (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.87)

Add feedback

Last-iterateConvergenceinExtensive-FormGames

Neural Information Processing SystemsFeb-9-2026, 10:35:14 GMT

Regret-minimization algorithms are among the most popular approaches to approximate Nash equilibria.

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.04)
North America > Canada > Alberta (0.04)

Technology:

Information Technology > Game Theory (0.69)
Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

33d6548e48d4318ceb0e3916a79afc84-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 04:43:54 GMT

conséquence, min ln 768, probability, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning

Neural Information Processing SystemsNov-21-2025, 14:07:58 GMT

The environment and an agent's interactions are typically modeled as a Markov

algorithm, mdp, state-action pair, (15 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.83)

Add feedback

Learning Unknown Markov Decision Processes: A Thompson Sampling Approach

Yi Ouyang, Mukul Gagrani, Ashutosh Nayyar, Rahul Jain

Neural Information Processing SystemsNov-21-2025, 08:57:50 GMT

A naive approach to an unknown model is the certainty equivalence principle .

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.83)

Add feedback

Tractable Instances of Bilinear Maximization: Implementing LinUCB on Ellipsoids

Zhang, Raymond, Hadiji, Hédi, Combes, Richard

arXiv.org Machine LearningNov-12-2025

We consider the maximization of $x^\top θ$ over $(x,θ) \in \mathcal{X} \times Θ$, with $\mathcal{X} \subset \mathbb{R}^d$ convex and $Θ\subset \mathbb{R}^d$ an ellipsoid. This problem is fundamental in linear bandits, as the learner must solve it at every time step using optimistic algorithms. We first show that for some sets $\mathcal{X}$ e.g. $\ell_p$ balls with $p>2$, no efficient algorithms exist unless $\mathcal{P} = \mathcal{NP}$. We then provide two novel algorithms solving this problem efficiently when $\mathcal{X}$ is a centered ellipsoid. Our findings provide the first known method to implement optimistic algorithms for linear bandits in high dimensions.

algorithm, artificial intelligence, machine learning, (14 more...)

arXiv.org Machine Learning

2511.07504

Country: